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Abstract 

An overview of the recursive equations based models and tlieir ap- 
plications in simulation based analysis and optimization of queueing 
systems is given. These models provide a variety of systems with a 
convenient and unified representation in terms of recursions for arrival 
and departure times of customers, which involves only the operations 
of maximum, minimum, and addition. 

1 Introduction 

As a representation of dynamics of queueing systems, recursive equations 
have been introduced by Lindley 1952 in his investigation of the G/G/1 
queue. The representation has proved to be useful in both analytical study 
and simulation of queues, and was extended to cover a variety of queueing 
systems including open and closed tandem single-server queues with both 
infinite and finite buffers, the G/G/m system, and queueing networks. 

The recursive equations were originally expressed in terms of the wait- 
ing times of customers ([HIS])- Equations following this classical approach 
remain traditional in the queueing theory, one can find them in many of the 
recent works devoted mainly to theoretical aspects of the investigation of 
queueing systems (see, e.g., |3]). 

In the last few years, another representation based on recursions for the 
arrival and departure times of customers has gained acceptance in works 
dealing with the simulation study of queueing systems and its related fields 
including performance evaluation and sensitivity analysis. The items of our 
list of references, other than those cited above, can serve as an illustration. 
Although these equations may be readily derived from those of the classical 
type, they offer a more convenient and unified way of representing dynamics 
of queueing systems as well as their performance measures. 
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The purpose of this paper is to give a brief overview of the recursive 
equations and their appHcations in simulation based analysis and optimiza- 
tion of queueing systems. The subsequent sections present the equations 
expressed in terms of the arrival and departure times, which describe the 
systems most commonly encountered in studies of queues, and discuss the 
representation of performance measures, associated with these queueing sys- 
tem models. Applications of the models to the development of simulation 
algorithms as well as to the analysis of system performance measures and 
estimation of their sensitivity are also outlined. Finally, limitations on the 
use of the models are briefly discussed. 

2 Models of Queueing Systems 

Most of the models appearing in this section actually present single-server 
systems which can have both finite and infinite buffers, and operate accord- 
ing to the first-come, first-served (FCFS) queueing discipline. Also included 
are the equations representing the G/G/m system, and a rather general 
model of a queueing network with a deterministic routing mechanism. 



We start with this model which provides the basis for representing more 
complicated queueing systems. The G/G/1 system consists of a server and 
a buffer with infinite capacity (Fig.l). Once a customer arrives into the 
system, he occupies the server provided that it is free. If the customer finds 
the server busy, he is placed into the buffer, and starts waiting to be served. 
The queue discipline in the system is presumed to be FCFS. 



For the G/G/1 queue, we denote the interarrival time between the A;th 
customer and his predecessor by ct^, and the service time of the feth cus- 
tomer by Tk ■ Furthermore, let Aj^ be the fcth arrival epoch to the queue, 
and Dk be the feth departure epoch from the queue, = 1, 2, . . . . Provided 
that the system starts operating at time zero, it is convenient to set = 
and £)o = 0. One may now describe the dynamics of the G/G/1 queue as 



2.1 The G/G/l Queue 



Dk 




Figure 1: The G/G/1 queue. 



Ak = Ak-i + Qfc 

Dk = {Ak^ Dk-i)+Tk, 
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where V stands for the maximum operator, k = 1,2, . 



2.2 Tandem Systems of Single-Server Queues 

Consider a system of N single-server queues with infinite buffers, operating 
in tandem as shown in Fig. 2. 
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Figure 2: Single-server queues in tandem. 

Each customer that arrives into this system has to pass through all the 
queues so as to occupy consecutively every servers from 1 to A^, and then 
leave the system. We suppose that upon his service completion at a queue, 
the customer arrives into the next queue immediately. 

To set up the recursive equations representing the system in a convenient 
way, let us introduce the symbols and tJ} respectively for the departure 
and service times of the kth. customer at queue n. However, we maintain the 
symbols and Dk = to denote the kth arrival and departure epochs 
for the whole system. With these notations, the equations are written as 
(Shanthikumar and Yao 1989a; and Chen and Chen 1990) 

Dl = {AkVDl_,) + Tl 

Dl = {Dl-^yDl_^) + T^, n = 2,...,N. 

2.2.1 Closed Tandem Systems 

Suppose that in the above tandem system all the customers after their ser- 
vice completion at the A'^th server return to the 1st queue for the next cycle 
of service (see Fig. 3). 

1 2 N 

^□o— DO ^Do^ 



Figure 3: A closed tandem system of single-server queues. 

Furthermore, we assume that at the initial time, there are -ftr„, < 
Kn < CO, customers in the buffer of server n. Assuming = —oo for all 
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A; < and n = 1, . . . A^, one can represent the closed system in the form 
(see, e.g., [11[5]) 

Dl = Pri-„VDLl)+rfc^ n = 2,...,Ar. 
2.2.2 Tandem Queues with Finite Buffers 

We now turn to the discussion of the system of queues which provide only 
a limited number of places in their buffers for customers waiting for service. 
In such a system, if the buffer at a server has finite capacity, the preceding 
server may be blocked according to one of the blocking rules. In this paper 
we shall restrict ourselves to manufacturing blocking and communication 
blocking which are more frequent in practice. 

Consider a system of N queues, depicted in Fig.4. We denote the ca- 
pacity of the buffer at server n by Bn-, < i?„ < cxd, n = 2, 3, . . . , A^. As 
the input buffer of the system, the buffer of the 1st server is assumed to be 
infinite. 
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Figure 4: Tandem single-server queues with finite buffers. 

Let us first suppose that the system operates according to the manu- 
facturing blocking rule. Under this type of blocking, if a customer upon 
completion of his service at server n sees the buffer of the (n -|- l)st server 
full, he cannot unoccupy the nth server until the next server provides a free 
space in its buffer. Since buffers become free as customers are called for- 
ward for service, the nth server is unoccupied as soon as the (n + l)st server 
completes its current service to initiate the service of the next customer. It 
is not difficult to understand that the departure of the A;th customer from 
server n occurs not earlier than that of the {k — Bn+i — l)st customer from 
server n + 1. Taking into account this condition, one may represent the 
equations as ( O [7] ) 

Dl = {iAkVDl_,) + Tl)\/Dl_s^_, 

Dl = {(Dl-'yDl^,) + Tj:)yDltl^^^^„ n = 2,...,A^-l 
D^ = {Dj:-'vDti) + r^. 

The communication blocking rule requires from a server not to initiate 
the service of a customer if the buffer of the next server is full. In this case, 
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the server remains unavailable until the current service at the next server 
is completed. For the system with communication blocking, we have (Chen 
and Chen 1990) 

Dl = {Dl-'yDl_,yDltl,^^^_,) + T^, n = 2,...,iV-l 
2.3 G/G/m Queues 

Equations representing the G/G/m queue (Fig. 5) as recursions for the wait- 
ing times of customers have been first introduced by Kiefer and Wolfowitz 
1955 [2]. These recursive equations were expressed in general terms rather 
than in an explicit form similar to those presented above. 




Figure 5: The G/G/m queue. 

To represent the equations for the G/G/m queue, 1 < m < oo, in 
terms of the arrival and departure times of customers, let us further insert 
the symbol for the service completion time of the customer which is 
the kth. to arrive into the system. Note that in multi-server queues the 
feth departure time and the completion time of the kih. customer may not 
coincide as contrasted to the G/G/1 queue which does not recognize them. 

Now we may describe the dynamics of the system through the equations 
proposed in [8] (a similar representation in terms of waiting times can be 
found in [9j) 

Ak = Ak-i + Ofc 

Ck = {Ak\/ Dk-m) + Tk 

Dk = f\ (Cj, V--- VCjJ ACfc+^_i, 

l<ji<...<jk<k+m-2 

where A signifies the minimum operator. Note that with m = 1 the above 
set of equations is reduced to that of the G/G/1 queue. 

2.4 Networks with Deterministic Routing 

We complete this section by presenting a rather general model of a closed 
queueing network with deterministic routing described in [lOl [11] (see also a 
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similar model in [T2] ) . Let us first consider a network consisting of N single- 
server nodes. In each node there are a server and an infinite buffer in which 
customers are placed at their arrival so as to wait for service if it cannot 
be initiated immediately. After his service completion at one node, each 
customer goes to another node chosen according to the routing procedure 
defined as follows. For the network, we assume that a matrix 

/ sii si2 ■■■ sik ■■■ \ 

S2l S22 ■ ■ ■ S2k ■ ■ ■ 
\ SNl SN2 ■ ■ ■ SNk ■ ■ ■ J 

is given, Snk determines the next node to be visited by the customer who 
is the kth to depart from node n, Snk £ {li • • • > ^} ; n = 1, . . . , N ; k = 
1,2, ... . It is also assumed that at the initial time, all servers are free, and 
there are Kn , < Kn < 00 , customers in the buffer at node n . 

For node n, we denote the kth arrival and departure epochs respectively 
by and , and the service time of the customer who is the A;th to arrive 
by . Furthermore, let us introduce the set 

Dn = {D'llsik = n;i = I, ... ,N;k = 1,2, .. .}, 

which is constituted by the departure times of the customers who have to go 
to node n . Finally, we may represent the network by means of the equations 

D^, = {AlyDl_,)+Tl^ 

^ f 0, \ik<Kn 

\ -A^-Kn^ otherwise, 

where is the arrival time of the customer which is the fcth to arrive into 
node n after his service at any node of the network. In other words, the 
symbol A!^ differs from in that it relates only to the customers really 
arriving into node n, and does not to those occurring in this node at the 
initial time. It is defined as 

Al= l\ {Diy---\/Dk), 

{Di,...,Dk}cD„ 

where minimum is taken over all fc -subsets of the set Dn . 

It is easy to understand how tandem queueing systems with infinite 
buffers may be represented as networks like that just described. Moreover, 
changing the first one from the above equations, one can readily extend the 
model to cover networks with nodes which may have many servers. These 
servers may operate both in tandem and in parallel, and even form a network 
themselves. 
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3 Performance Measures 



We are now in a position to show how the performance measures which one 
normahy chooses in the analysis of queueing systems may be represented on 
the basis of the models described above. We start with presenting sample 
performance measures associated with the systems under consideration, and 
then briefly discuss the evaluation of system performance measures in the 
general case. 

3.1 Networks with Single-Server Nodes 

Suppose that we observe the network until the Kth service completion at 
node n, K = 1,2, . . 1 < n < N . As sample performance measures for 
node n in the observation period, one can consider the following average 
quantities ([3 [TOl [II]): 

K 

^^(I?^ — A'^)/K, the total time of one customer; 

k=l 
K 

^^(D^ — — tJ})/K; the waiting time of one customer; 

k=l 

K/D"^, the throughput rate of the node; 

K 

/ , the utilization of the server; 

fe=l 

K 

^^(D^ — A^)/D'^, the number of customers at the node; 

k=l 
K 

'^{D'k - Al - T^)/-D^, the queue length at the node. 

k=l 

Note that, assuming the service times to be given, one can express 
these measures in closed form only in terms of these times, involving arith- 
metic operations, and the operations of maximum and minimum 

3.2 Tandem Systems of Queues 

Since tandem systems can be considered as networks with deterministic rout- 
ing, the above sample performance measures are also suited to the tandem 
systems. In addition to these measures which are actually server related 
performance criteria, for a tandem system with N servers we may define 
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customer related performance measures [7] 



K 



Sk 



Ak)/K, the average system time of one customer; 



k=l 



Wk 




Ak — y tJ} \ /K, the average waiting time of one customer. 



fc=i \ 



n=l 



Finahy, there are sample performance measures inherent only in the 
systems with finite buffers. As an example, the average idle time of a server, 
say server n, can be considered. This measure is written in the same form 
for both the manufacturing and communication blocking rules as 



3.3 Multi-Server Queues 

Sample performance measures in multi-server queueing systems can be rep- 
resented through formulas which are closely similar to those applied in 
queueing networks. For instance, the average throughput may be defined 
exactly as we have defined T^. To represent properly the remaining mea- 
sures, one however has to take into account the distinction between the kth 
completion and the kth departure times, involved in the G/G/m queue. 
With this distinction, replacing the symbols Dk by Ck is required in the 
above formulas so as to provide appropriate expressions for the sample per- 
formance measures of multi-server queues. 

3.4 Evaluation of System Performance 

We suppose now that the service times Tk (and the interarrival times ak if 
they are given) are defined as random variables = t^^OjOo), where 6 £ Q 
is a set of decision parameters, and a; is a random vector. In this case, as 
it results from the above representations of the queueing systems and their 
performance, the arrival epochs Ak and the departure epochs Dk , together 
with the sample performance measures also present random variables. Let 
Fx = Fk{0,uj), be a sample performance measure of the system. As is 
customary, we define the system performance measure associated with Fx 
by the expected value 



Based on a finite observation period. Fx is generally referred to as finite- 
horizon performance measure. Another criterion, a steady-state performance 



K 



Ik = Y1 - (^r ' V z?Lj - tj:) /k. 



k=l 



Fxie) = E^[FK{e,uj)]. 
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measure, intended to describe a long time behaviour of a system is defined 
as 

F{e)= lim E^[FK{e,u:)\. 

K^oo 

Although we may express sample performance measures in closed form, 
in the case of general random variables determining the service times of 
customers, it is often very difficult or even impossible to obtain analytically 
the criteria Fk and especially F . In this situation, one generally applies 
a simulation technique which allows of obtaining values of Fk{0,uj), and 
then estimates the system performance by using the Monte Carlo approach. 
Note however, that information concerning the explicit form of the sample 
performance measures normally proves to be very useful to the simulation 
study and optimization of queueing systems. 

4 Application of the Models 

In this section we briefly outline a selection of the application areas of the 
recursive representation in simulation based analysis and optimization of 
queueing systems. The section concludes with remarks concerning limita- 
tions on the use of the models in representing queueing systems. 

4.1 Design of Simulation Algorithms 

Since recursive equations determine a global structure of changes in queueing 
systems consecutively in a very natural way, they provide the basis for the 
development of very efficient simulation procedures (see, e.g., [3131113]). Al- 
though the simulation technique based on recursive representations of queue- 
ing systems may rank below the traditional event-scheduling approach in its 
versatility, the algorithms applying this technique are normally superior to 
others in reducing time and memory costs. Moreover, these algorithms are 
usually best suited to the implementation on parallel and vector processors. 
As an illustration, one can consider parallel simulation algorithms in [Hll3|. 

4.2 Variance Reduction in Simulation 

Closely related to the queueing system simulation procedures are variance 
reduction techniques which are intended to improve the accuracy of simula- 
tion output ([H]). In order for a variance reduction method to be success- 
fully employed in estimating a system performance F^ , certain conditions 
normally have to be imposed on its associated sample performance measure 
Fk ■ Specifically, the antithetic variates method and the common random 
numbers method require that Fx as a function of the random argument 
u be monotone (see, e.g., [H])- Examples of establishing such monotonic- 
ity properties from the recursive representation of queueing systems can be 
found in [15j . 
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4.3 Investigation of System Performance Measures 

Another area of applications of the models includes the investigation of 
properties inherent in performance measures of queueing systems, such as 
monotonicity and convexity with respect to system parameters . It is nor- 
mally not difficult to examine these properties for the systems described by 
equations involving only the operations of maximum and addition (e.g., tan- 
dem queues with both infinite and finite buffers). One can find an extended 
discussion of this subject in [6l [T2l [T6]. 

4.4 Sensitivity Analysis and Estimation 

Since there are generally no explicit representations as functions of sys- 
tem parameters 6 available for the performance measure, one may evaluate 
its sensitivity (or its gradient, when the parameters are continuous) by no 
way other than through the use of estimates obtained from simulation ex- 
periments. Very efficient procedures of obtaining gradient estimates may 
be designed using new technique called infinitesimal perturbation analysis 
(IPA) (see, e.g., |17)). The IPA algorithms which are actually based on the 
recursive representations of queueing systems, can serve as an important 
line of the application of the models under discussion ([TSl E]). Finally, 
these models provided a useful framework for examining unbiasedness and 
consistency of IPA estimates in [T21 [ISl [13 E] • 

4.5 Limitations on the Use of the Models 

One can see that the general model of the network, as it has been presented 
above, treats of queueing systems from the viewpoint of service facilities 
rather than of particular customers. Specifically, for each node n only the 
arrival and departure instants are essential, whereas it makes no difference 
which of the customers proves to arrive or to depart. Moreover, both times 
and do not need to be associated with a single customer, as it 
normally happens in nodes with many servers operating in parallel. As a 
consequence, the models do not allow of representing systems with many 
classes of customers through recursive equations in closed form. Finally, 
since nodes do not distinguish among customers in some sense, the order in 
which customers are selected from a queue for service is of no concern, and 
therefore, these models are incapable of identifying distinct queue disciplines. 
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